Bayesian parameter estimation and adaptive control of Markov processes with time-averaged cost
نویسندگان
چکیده
منابع مشابه
Bounded Parameter Markov Decision Processes Bounded Parameter Markov Decision Processes
In this paper, we introduce the notion of a bounded parameter Markov decision process as a generalization of the traditional exact MDP. A bounded parameter MDP is a set of exact MDPs speciied by giving upper and lower bounds on transition probabilities and rewards (all the MDPs in the set share the same state and action space). Bounded parameter MDPs can be used to represent variation or uncert...
متن کاملThe effect of parameter estimation on Phase II control chart performance in monitoring financial GARCH processes with contaminated data
The application of control charts for monitoring financial processes has received a greater focus after recent global crisis. The Generelized AutoRegressive Conditional Heteroskedasticity (GARCH) time series model is widely applied for modelling financial processes. Therefore, traditional Shewhart control chart is developed to monitor GARCH processes. There are some difficulties in financial su...
متن کاملObserver-Side Parameter Estimation For Adaptive Control
of Thesis Presented to the Graduate School of the University of Florida in Partial Fulfillment of the Requirements for the Degree of Master of Science OBSERVER-SIDE PARAMETER ESTIMATION FOR ADAPTIVE CONTROL By Jason Nezvadovitz August 2017 Chair: Warren Dixon Major: Mechanical Engineering In adaptive control, a controller is precisely designed for a certain model of the system, but that model’s...
متن کاملStochastic Adaptive Control via Consistent Parameter Estimation
The paper introduces novel techniques for achieving global convergence results and improving transient performance in stochastic adaptive control. For example, there is introduced switching between a least squares and an extended least squares parameter estimation algorithm according to an ill-conditioning measure, and there is appropriate selection of external persistently exciting signals, th...
متن کاملParameter estimation aspects in adaptive control
This paper provides a solution to the convergence of the parameter estimates to their true values in an ideal adaptive regulation context. The key design feature consists in the use of an asymptotically vanishing internally generated exciting sequence. 1. Introduction IN THIS paper, the following question is addressed: given a known order linear time invariant system, how does one design an ada...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Applicationes Mathematicae
سال: 1998
ISSN: 1233-7234,1730-6280
DOI: 10.4064/am-25-3-339-358